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Many of the engineering disciplines have organized technical data into handbooks or data com- 
pendiums. These reference works are used by engineers for a large variety of engineering tasks 
throughout an engineered product’s entire life cycle. Electronic engineers, for example, are able 
to use compendiums of failure data in both the design and maintenance of electronic components, 
circuits, and systems. Unfortunately, there is a distinct lack of such engineering aids for use by 
members of the software engineering community. 

As part of its role as a software engineering information analysis center, the Data and Analysis 
Center for Software is currently designing a software engineering data compendium. Hopefully, 
this compendium will serve as a start in filling in some of the gaps that exist in software engineer- 
ing data. 

The paper will highlight some of the more critical areas that are considered during the design of 
an engineering reference guide such as a data compendium. These areas include; 

• Identifying potential subject areas for the compendium. 

• Identifying and evaluating potential data sources. 

• Identifying key data eleihMfs and determining the-most- effective^ methods of data 
organization and presentation for use by software engineers. 

• Summarizing the data at a level that minimizes volume while optimizing information 
value. 

• Using automated tools to effectively and efficiently manage, organize, and report the 
data. 
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DESIGNING A S/Vl DATA COMPENDIUM 

§ ENGINEERING DISCIPLINES USUALLY HAVE HANDBOOKS 

• EXAMPLE : 

ELECTRONIC ENGINEERS - COMPONENT FAILURE HANDBOOKS 

• SOFTWARE ENGINEERS DO NOT HAVE HANDBOOKS OR 

DATA COMPENDIUMS 

• AS AN lAC^ DACS IS DESIGNING A SOFTWARE ENGINEERING 

DATA COMPENDIUM 

• GOAL: 

TO SERVE AS A START FOR A SOFTWARE ENGINEERING 
HANDBOOK 


71 


L. Duvall 
IITRI 
4 of 8 



CRITICAL AREAS IN DATA COMPENDIUn DESIGN 


• IDENTIFYING SUBJECT AREAS 

• IDENTIFYING AND EVALUATING DATA SOURCES 

• IDENTIFYING KEY DATA ELEMENTS AND METHODS 

OF ORGANIZATION 

• SUMMARIZING THE DATA AT AN OPTIMUM LEVEL 

• USING AUTOMATED TOOLS 


IDENTIFYING POTENTIAL SUBJECT AREAS 

• POSSIBLE SUBJECT AREAS 

- failure/test data 

- cost/productivity data 

- COMPONENT DATA 

- project/management data 
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IDENTIFYING & EVALUATING POTENTIAL DATA SOURCES 

• POTENTIAL SOURCES 

- BSDS (6spr/smn related datasets) 

- RADC PRODUCTIVITY DATABASE (PRODUCTIVITY) 

- NASA SEL DATABASE (PRODUCTIVITY, COMPONENT DATA, 

PROJECT DATA, SPR/sMN, TEST RESULTS) 

- faa/csc database (productivity, spr/smn) 

• EVALUATION CRITERIA 

- AVAILABILITY 

- COMPLETENESS 

- APPLICABILITY 

- CONSISTENCY 

- MEDIA (machine READABLE) 
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IDENTIFYING KEY DATA ELEf€NTS & METHODS OF ORGANIZATION 


• KEY DATA ELEMENTS DERIVED FROM SUBJECT AREAS 



• ORGANIZATION DEPENDENT ON SUBJECT & SUMMARIZATION 
LEVELS 


SUMMARIZING DATA AT OPTIMUM LEVEL 

• DATABASE "dUMPS" RESULT IN INFORMATION OVERLOAD 

• HIGH SUMMARIZATION LEVELS RESULT IN INFORMATION LOSS 

• BALANCE BETWEEN INFORMATION OVERLOAD & LOSS MUST BE BALANCED 


USING AUTOMATED TOOLS 

• DATABASE MANAGEMENT SYSTEMS - MDQS 

• STATISTICAL PACKAGES - SPSS 

• TEXT PROCESSING TOOLS - RUNOFF 
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AVERAGE PROGRAMMER HOURS 




COMPONENT TYPE 

String Processing 

Scientific 
Command & Control 

Business & Financial 

Database Application 





